Unambiguous Prioritized Repairing of Databases

نویسندگان

  • Benny Kimelfeld
  • Ester Livshits
  • Liat Peterfreund
چکیده

In its traditional definition, a repair of an inconsistent databaseis a consistent database that differs from the inconsistent onein a “minimal way.” Often, repairs are not equally legiti-mate, as it is desired to prefer one over another; for example,one fact is regarded more reliable than another, or a morerecent fact should be preferred to an earlier one. Motivatedby these considerations, researchers have introduced and in-vestigated the framework of preferred repairs, in the contextof denial constraints and subset repairs. There, a priorityrelation between facts is lifted towards a priority relationbetween consistent databases, and repairs are restricted tothe ones that are optimal in the lifted sense. Three notionsof lifting (and optimal repairs) have been proposed: Pareto,global, and completion.In this paper we investigate the complexity of decidingwhether the priority relation suffices to clean the databaseunambiguously, or in other words, whether there is exactlyone optimal repair. We show that the different lifting se-mantics entail highly different complexities. Under Paretooptimality, the problem is coNP-complete, in data complex-ity, for every set of functional dependencies (FDs), exceptfor the tractable case of (equivalence to) one FD per rela-tion. Under global optimality, one FD per relation is stilltractable, but we establishΠp2-completeness for a relationwith two FDs. In contrast, under completion optimality theproblem is solvable in polynomial time for every set of FDs.In fact, we present a polynomial-time algorithm for arbitraryconflict hypergraphs. We further show that under a generalassumption of transitivity, this algorithm solves the problemeven for global optimality. The algorithm is extremely sim-ple, but its proof of correctness is quite intricate.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recognizing Determinism in Prioritized Repairing of Inconsistent Databases

A repair of an inconsistent database is traditionally defined as a consistent database that differs from the inconsistent one in a “minimal way.” As there are often reasons to prefer one repair over another, researchers have introduced and investigated the framework of preferred repairs, where a priority relation between facts is lifted towards a priority relation between consistent databases, ...

متن کامل

A Diagnostic Approach to Repairing Constraint Violations in Databases

Repairing violations of integrity constraints in databases can be seen as an interleaving diagnostic/repair process. In this paper we introduce a new approach on repairing constraint violations by adopting existing techniques from model–based diagnosis. Violations of integrity constraints observed in an inconsistent database state are diagnosed and repair actions are deduced from diagnoses. By ...

متن کامل

A Diagnostic Approach to Repairing Constraint Violations in Databasesz

Repairing violations of integrity constraints in databases can be seen as an interleaving diagnostic/repair process. In this paper we introduce a new approach on repairing constraint violations by adopting existing techniques from model{based diagnosis. Violations of integrity constraints observed in an inconsistent database state are diagnosed and repair actions are deduced from diagnoses. By ...

متن کامل

CrowdAidRepair: A Crowd-Aided Interactive Data Repairing Method

Data repairing aims at discovering and correcting erroneous data in databases. Traditional methods relying on predefined quality rules to detect the conflict between data may fail to choose the right way to fix the detected conflict. Recent efforts turn to use the power of crowd in data repairing, but the crowd power has its own drawbacks such as high human intervention cost and inevitable low ...

متن کامل

A Diagnostic Approach to Repairing ConstraintViolations in

Repairing violations of integrity constraints in databases can be seen as an in-terleaving diagnostic/repair process. In this paper we introduce a new approach on repairing constraint violations by adopting existing techniques from model{based diagnosis. Violations of integrity constraints observed in an inconsistent database state are diagnosed and repair actions are deduced from diagnoses. By...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1603.01820  شماره 

صفحات  -

تاریخ انتشار 2016